Data-driven modulation filter design under adverse acoustic conditions and using phonetic and syllabic units

نویسنده

Michael L. Shire

چکیده

Constructing speech feature extraction methods that are robust to many types of corrupting acoustic environments remains a daunting task and it is instructive to investigate which properties of the speech carry the discriminative information for recognition under a variety of conditions. In this paper we describe results for generating RASTA-style modulation filters under a number of acoustic environments. We utilize Linear Discriminant Analysis in a manner previously described by van Vuuren and Hermansky to automatically generate discriminant filters for speech with artificially added background noise and reverberation. We also generate the filters using both phonetic and syllabic classification targets. Trends in the responses of the discriminant filters lend support to feature extraction design decisions employed by RASTA-PLP and Modulation-filtered Spectrogram features. Further, tests with added reverberation corroborate views on the perceptual stability of syllabic rates.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Syllable structure based phonetic units for context-dependent continuous Thai speech recognition

Choice of the phonetic units speech recognizer is a factor greatly affecting the system performance. Phonetic units are normally defined according to the acoustic properties of a speech. Nevertheless, with the limit of training data, too delicate acoustic properties are ignored. Syllable structure is one of the properties usually ignored in English phonetic units due to a lot of possible onsets...

متن کامل

Dual-route phonetic encoding: some acoustic evidence

Contemporary psycholinguistic models suggest that there may be two possible routes in phonetic encoding: a 'direct' route which uses stored syllabic units, and an 'indirect' route which relies on the on-line assembly of sub-syllabic units. The computationally more efficient direct route is likely to be used for high frequency words, whereas the indirect route is most likely to be used for novel...

متن کامل

Constrained Subword Units for Speaker Recognition

Phonetic features have been proposed to overcome performance degradation in spectral speaker recognition in difficult acoustic conditions. The harmful effect of those conditions, however, is not restricted to spectral systems but also affects the performance of the open-loop phone recognisers on which phonetic systems are based. In automatic speech recognition, larger subword units and the use ...

متن کامل

Title of dissertation : SPEECH RECOGNITION BASED ON PHONETIC FEATURES AND ACOUSTIC LANDMARKS

Title of dissertation: SPEECH RECOGNITION BASED ON PHONETIC FEATURES AND ACOUSTIC LANDMARKS Amit Juneja, Doctor of Philosophy, 2004 Dissertation directed by: Carol Espy-Wilson Department of Electrical and Computer Engineering A probabilistic and statistical framework is presented for automatic speech recognition based on a phonetic feature representation of speech sounds. In this acoustic-phone...

متن کامل

A probabilistic framework for landmark detection based on phonetic features for automatic speech recognition.

A probabilistic framework for a landmark-based approach to speech recognition is presented for obtaining multiple landmark sequences in continuous speech. The landmark detection module uses as input acoustic parameters (APs) that capture the acoustic correlates of some of the manner-based phonetic features. The landmarks include stop bursts, vowel onsets, syllabic peaks and dips, fricative onse...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1999

Data-driven modulation filter design under adverse acoustic conditions and using phonetic and syllabic units

نویسنده

چکیده

منابع مشابه

Syllable structure based phonetic units for context-dependent continuous Thai speech recognition

Dual-route phonetic encoding: some acoustic evidence

Constrained Subword Units for Speaker Recognition

Title of dissertation : SPEECH RECOGNITION BASED ON PHONETIC FEATURES AND ACOUSTIC LANDMARKS

A probabilistic framework for landmark detection based on phonetic features for automatic speech recognition.

عنوان ژورنال:

اشتراک گذاری